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Abstract — This paper deals with the computational complexity 
of conditions which guarantee that the NP-hard problem of 
finding the sparsest solution to an underdetermined linear system 
can be solved by efficient algorithms. In the literature several such 
conditions have been introduced. The most well-known ones are 
the mutual coherence, the restricted isometry property (RIP), 
and the nullspace property (NSP). While evaluating the mutual 
coherence of a given matrix is easy, it has been suspected for some 
time that evaluating RIP and NSP is computationally intractable 
in general. We confirm these conjectures by showing that for 
a given matrix A and positive integer k, computing the best 
constants for which the RIP or NSP hold is, in general, NP- 
hard in the strong sense. These results are based on the fact that 
determining the spark of a matrix is strongly NP-hard, which is 
also established in this paper. 

Index Terms — Compressed Sensing, Computational Complex- 
ity, Sparse Recovery Conditions 



I. Introduction 

A CENTRAL problem in compressed sensing (CS), see, 
e.g., [1], [2], [3], is the task of finding a sparsest solution 
to an underdetermined linear system, i.e., 



min||a;||o s.t. Ax 



(Po) 



for a given matrix A g R mx ™ with m < n, where ||x||o 
denotes the £o-q uas i-norm, i.e., the number of nonzero entries 
in x. This problem is well-known to be NP-hard, cf. [MP5] 
in [4]; the same is true for the variant with Ax = b replaced 
by \\Ax — &H2 < e [5]. Thus, in practice, one often resorts 
to heuristics. One of the most popular approaches is known 
as basis pursuit (BP) or l\ -minimization, see, e.g., [6], where 
instead of (Po) one considers 



mm \\x h 



s.t. Ax = b. 



(Pi) 



Here, ||a:||i = SlLil^l denotes the £i-norm. It can be 
shown that under certain conditions, the optimal solutions 
of (Po) and (Pi) are unique and coincide. In this case, 
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one says that ^-^-equivalence holds or that the ^-solution 
can be recovered by the ^-solution. Similar results exist for 
the sparse approximation (denoising) variants with constraint 

||j4ss-&||a<e. 

Many such conditions employ the famous restricted isome- 
try property (RIP) (see [7] and also [8], [9]), which is satisfied 
with order k and a constant 5 k by a given matrix A if 



(l-S k )\\x\\ 2 2 <\\Ax\\l<(l + S k )\\x\ 



(1) 



holds for all x with 1 < ||a;||o < k. One is usually interested 
in the smallest possible constant 6_ k , the restricted isometry 
constant (RIC), such that (1) is fulfilled. For instance, if 
5_ 2 k < — 1, all x with at most k nonzero entries can be 
recovered (from A and b := Ax) via basis pursuit, see, e.g., 
[10]. Several probabilistic results show that certain random 
matrices are highly likely to satisfy the RIP with desirable 
values of S_ k , see, for instance, [11], [9]. There also has 
been work on deterministic matrix constructions aiming at 
relatively good RIPs, see, e.g., [12], [13], [14]. The RIP also 
provides sparse recovery guarantees for other heuristics such 
as Orthogonal Matching Pursuit and variants [15], [16], as 
well as in the denoising case where the right hand side b 
has been slightly perturbed and one employs the constraint 
\\Ax - &|| 2 < £, see, e.g., [10]. 

In the literature, it is often mentioned that evaluating the 
RIP, i.e., computing the constant 6_ k for some A and k, 
is presumably a computationally hard problem. Most papers 
seem to refer to NP-hardness, but this is often not explicitly 
stated. This motivated the development of several (polynomial- 
time) approximation algorithms for S_ k , e.g., the semidefinite 
relaxations in [17], [18]. However, while a widely accepted 
conjecture in the CS community, NP-hardness has, to the best 
of our knowledge, not been proven so far. 

Recently, some first results in this direction have been 
obtained: In [19], hardness results about the RIP were de- 
rived from the complexity of dense subgraph problems, under 
certain (non-standard) assumptions. In work independent from 
this paper, [20] shows that it is (weakly) NP-hard to verify (1) 
for given A, k and 5 k £ (0, 1), see Section III for a discussion. 

Another popular tool for guaranteeing £q-Ii -equivalence 
is the nullspace property (NSP), see, e.g., [21], [22], [23], 
which characterizes recoverability by (Pi) (in fact, via t p - 
minimization with < p < 1, see [24], [25]) for sufficiently 
sparse solutions of (Po). The NSP of order k is satisfied with 
constant a k if 

NU,i < a fe ||a:||i ( 2 ) 
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holds for all vectors x in the nullspace of A (i.e., Ax = 0), 
where ||£c||fe,i denotes the sum of the k largest absolute values 
of entries in x. Similar to the RIP case, one is interested 
in the smallest constant a k , the nullspace constant (NSC), 
such that (2) is fulfilled. Indeed, if and only if a k < 1/2, 
(Pi) with b := Ax, \\x\\ < k, has the unique solution x, 
which coincides with that of (P ); see, e.g., [21], [22], [26]. 
Moreover, error bounds for recovery in the denoising case can 
be given, see, e.g., [23]. 

Again, the computation of a k is suspected to be NP-hard 
and several heuristics have been developed to compute good 
bounds on a k , e.g., the semidefinite programming approaches 
in [27], [28], or an LP-based relaxation in [26]. However, as 
far as we know, no rigorous proof of (NP-)hardness has been 
given. 

In this paper, we show that it is strongly NP-hard to 
compute the RIC and NSC of a given matrix with given k; 
see Sections III and IV, respectively. More precisely, we show 
that unless P=NP, there is no polynomial time algorithm that 
computes S k or a k for all given instances (A,k). We also 
prove that certifying the RIP given A, k and some S k € (0, 1) 
is CoNP-complete in the strong sense; this is slightly stronger 
than the result from [20] mentioned above: Recall that strong 
NP-hardness implies that there cannot exist a fully polynomial- 
time approximation scheme (FPTAS), i.e., an algorithm that 
solves a minimization problem within a factor of (1+e) of the 
optimial value in polynomial time with respect to the input size 
and 1/e; an FPTAS often exists for weakly NP-hard problems. 

Prior to this, in Section II, we prove NP-hardness in the 
strong sense of computing the spark of a matrix, i.e., the 
smallest number of linearly dependent columns. In fact, our 
main results concerning the complexity of determining the 
RIC or NSC follow from reduction of a decision problem 
concerning the existence of inclusion-wise minimal such col- 
umn subsets. The term spark was first defined in [29], where 
strong results considering uniqueness of solutions to (Po) were 
proven. Ever since, its value has been claimed to be NP-hard to 
calculate, but, to the best of our knowledge, without a proof or 
reference for this fact. It seems to have escaped researchers' 
notice that [30] contains a proof that deciding whether the 
spark equals the number of rows is weakly NP-hard, by 
reduction from the Subset Sum Problem (cf. [MP9] in [4]). 
Even earlier, in [31], the authors claim to have a proof, but give 
credit to the dissertation [32] for establishing NP-hardness 
of spark computations. However, after closer inspection, the 
result in [32] is in fact not about the spark, but the girth of so- 
called transversal matroids of bipartite graphs. We adapt the 
proof idea from [32], a reduction from the fc-Clique Problem, 
to vector matroids and thus establish that spark computation 
is strongly NP-hard; see Section II for more details. 

Instead of the intractable RIP, NSP, or spark, the weaker 
but efficiently computable mutual coherence [21] is sometimes 
used. It can be shown that the mutual coherence yields bounds 
on the RIC, NSC, and the spark; see, for instance, [24], [33]. 
Thus, imposing certain conditions involving the mutual coher- 
ence of a matrix can yield uniqueness and recoverability (by 
basis pursuit or other heuristics), see, e.g., [33]. However, the 
sparsity levels for which the mutual coherence can guarantee 



recoverability are quite often too small to be of practical use. 

Throughout the article, for an m x n matrix A and a subset 
S C {1, . . . , n}, we denote by As the submatrix of A formed 
by the columns indexed by S. Similarly, x$ denotes the part 
of a vector x containing the entries indexed by S. By A T 
and x T , we denote the transpose of a matrix A or vector 
x, respectively. For graph theoretic concepts and notation we 
refer to [34], for complexity theory to [4], and for matroid 
theory to [35]. 

II. Complexity issues related to the spark 

In this section, we deal with complexity issues related 
to linearly dependent columns of a given matrix A G 
Qmxn i nc i us i on - W i se minimal collections of linearly depen- 
dent columns are called circuits. More precisely, a circuit is 
a set C C {1, . . . , n} of column indices such that Aqx = 
has a nonzero solution, but every proper subset of C does not 
have this property, i.e., rank(Ac) = \C\ — 1 = rSink(A c \^) 
for every j € C. For the sake of notational simplicity, we 
will sometimes identify circuits C with solutions x e R" of 
Ax = having support C. The spark of A is then the size of 
its smallest circuit. 

As mentioned in the introduction, the computation of the 
size of a smallest circuit was already shown to be (weakly) 
NP-hard by [30, Theorem 1]. On the other hand, in [32] it 
was shown that computing a smallest circuit of a transversal 
matroid is (strongly) NP-hard. Only recently, a variant of 
the latter proof has resurfaced in [36], where it is used to 
derive complexity results for constructing so-called full spark 
frames, i.e., matrices exhibiting the highest possible spark. 
Every transversal matroid can be represented by a matrix 
over any infinite field or finite field with sufficiently large 
cardinality [37], but there is no known deterministic way to 
construct such a matrix. 

We present a modification of the aforementioned proof 
in the following — see Remarks 1 and 2 below for more 
discussion of the differences to [32] and [30]. 

Theorem 1: Given a matrix A e Q mx ™ and a positive 
integer k, the problem to decide whether there exist a circuit 
of A of size at most k is NP-complete in the strong sense. 

The following auxiliary results are used in our proof: 

Lemma 1: The vertex-edge incidence matrix of an undi- 
rected simple graph with N vertices, B bipartite components, 
and Q isolated vertices has rank N — B — Q. 

This result seems to be rediscovered every once in a while. 
The earliest proof we are aware of is due to van Nuffelen [38] 
and works through various case distinctions considering linear 
dependencies of the rows and consequences of the existence 
of isolated or bipartite components. 

Lemma 2: Let G = (V, E) be a simple undirected graph, 
without isolated vertices, with vertex set V and edge set E. Let 
A = A(G) be its vertex-edge incidence matrix, and let k > 4 
be some integer. Suppose G only has connected components 
with at least four vertices each, \E\ = (^j, and rank(A) = k. 
Then the graph G has exactly \V\ — k vertices. 

Proof: Let G = (V, E) and k > 4 be the simple graph 
and integer given in the statement of the lemma. Assume that 
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G has no component with less than 4 vertices, has \E\ = (Jj) 
edges, and that its incidence matrix A has rank(A) = k. 

Since G has no isolated vertices, Lemma 1 tells us that the 
number of vertices is 

N = rank(A) + B = k + B, 

where B is the number of bipartite components in G. Assume 
that B > 0, since otherwise the lemma is trivially true. 
We claim that the number of edges in G can be at most 



\E\ < 



4(iV - 4) 



B -2B. 



(3) 



To see this, recall that G can have at most (^) edges. Each 
connected component has at least four vertices. Since there 
are no edges between such a component and vertices outside, 
the total number of possible edges is reduced by at least 
4(7V — 4)/2 per component (the factor 1/2 ensures that we do 
not count any edges twice). Since G has at least B connected 
components, the possible number of edges is hence decreased 
at least by the second term. Moreover, since each bipartite 
component has at least four vertices, at least two of the 
potential edges cannot be present inside each such component, 
which yields the last term. Note that the bound (3) is sharp 
if G consists only of bipartite components with four vertices 
each. 

Expanding (3) using N = k + B, we obtain 

'N\ A(N - 4) 

2 J 2 
(k + B)(k + B-l) 4(fc + B - A) 



\E\ < 



-B — IB 



-B-2B 



ifc 2 



2 k 



kB 



k 

and observe that 



{kB + \B 2 — -y B), 



kB + ^B 2 -^B>5B + ^B 2 -^B = ^B 2 -^B>0 

if B > 0. Thus, there are strictly less than (*) edges, 
contradicting the requirement \E\ = (^)- Hence, B = 0. ■ 
Proof of Theorem 1: The problem is clearly in NP: 
Given a subset C of column indices of A, it can be verified in 
polynomial time that \C\ < k and that rank(Ac) = |C| — 1 = 
Y&nk(A C \{j}) for every j e C (by Gaussian elimination, see, 
e.g., [39]). 

To show hardness, we reduce the strongly NP-complete k- 
Clique Problem (cf. [GT19] in [4], or [40]): Given a simple 
undirected graph G, decide whether G has a clique, i.e., a 
vertex-induced complete subgraph, of size k. We may assume 
without loss of generality that k > 4. 

For the given graph G with n vertices and m edges construct 
a matrix A = (aj e ) of size (n + (*) — k — 1) x m as 
follows: Index the first n rows of A by the vertices of G 
and its columns with the edges of G. Let the first n rows 
of A contain the vertex-edge incidence matrix of G (i.e., set 
a ie = 1 if i e e, and otherwise). For the non-vertex rows 
n + j, j € {1, • • • , (2) - k - 1}, set a {n+j)e = 2 e ^ (here 
we identify the vertices and edges with their indices); note 



that this corresponds to the bottom part of A consisting of 
a Vandermonde matrix (each column consists of increasing 
powers of the distinct numbers 2 1 , . . . , 2 m ) without the usual 
all-ones first row. Clearly, this matrix A can be constructed 
in polynomial time, and its encoding length is polynomially 
related to that of the input. 

We first show that G has a fc-clique if and only if A has a 
circuit of size (Jj) . Suppose that G has a fc-clique, k > 4, say 
on the vertices in the set R (so that \R\ = fc), and with its ( 2 ) 
edges in the set C. Since Ac has all-zero rows for each vertex 
outside of R, \R\ + (number of non- vertex rows) = ( 2 ) — 1 = 
\C\ — 1. Clearly, a clique is never bipartite (it always contains 
odd cycles, for fc > 3). Hence, by Lemma 1, the rows of Ac 
indexed by R are linearly independent. The lower part of A 
was chosen Vandermonde-like, so all its submatrices have full 
rank; in particular, the one containing only columns from C 



has full row-rank ( 2 ) 



1. Moreover, every row of Ac 



indexed by R is linearly independent from the non-vertex rows 
of Ac- (This can be seen inductively: Since fc > 4, there are 
always less than |C| — 4 non-vertex rows in Ac- Thus, these 
rows along with any two unit vectors remain linearly inde- 
pendent. Replacing these two unit vectors by their sum, again 
leaves a linearly independent set of rows. We can continue 
this way, adding different unit vectors to the sum, and see that 
every 0/1-vector — in particular those that are the rows of Ac 
indexed by R — is linearly independent from the non-vertex 
rows of A c .) Thus, rank(A c ) = k + (fj) - k - 1 = \C\ - 1. 
Now observe that removing any edge from a fc-clique does 
not affect the rank of the associated incidence matrix, since 
the subgraph remains connected and non-bipartite with less 
vertices than edges (for k > 4). Thus, by Lemma 1, the rank 
of the nonzero vertex row part of Ac remains fc if any column 
from C is removed. Clearly, the rank of the lower part of Ac 
stays the same as well, as there still are more columns than 
rows. Thus, r&nk(A C \{ e }) = rank(Ac) = |C| — 1 for every 
e e C, i.e., C is a circuit. 

Conversely, suppose that A has a circuit C of size \C\ = (Jj) 
with k > 4. Then, by definition of a circuit, rank(Ac) = 
\C\ — 1, so Ac has at least \C\ — 1 nonzero rows. Since these 
include the \C\ — k— 1 non-vertex rows, the set R of nonzero 
vertex rows of A c has size \R\ > (|C|-l) - (|C|-fc-l) = fc. 
Moreover, the non-vertex part of Ac has full rank and, as seen 
above, is linearly independent from any vertex row of Ac- 
Thus, the upper part of Ac must have rank exactly fc. 

Observe that the subgraph (R, C) of G with vertex set R 
and edge set C cannot contain components with less than 4 
vertices: such a subgraph (R 1 , C') could have at most as many 
edges as vertices, so that the associated incidence matrix A' c , 
has full column rank. Removing a column corresponding to 
an edge e e C' would reduce the rank, i.e., vank(A' c ,^ e y) < 
rank(AJ 7 ,). In particular, removing any column from Ac does 
not affect the rank contribution of the non-vertex row part 
of A, so that removing e would also yield rank(A c \{ e }) = 
rank(Ac) — 1, contradicting the fact that C is a circuit. Thus, 
Lemma 2 applies to the graph (R, C) and yields that \R\ = fc. 
This implies that the vertices in R form a fc-clique, because R 
can induce at most (Jj) edges and the (Jj) edges in C are 
among them. 
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We now show that each circuit of A has size at least ( 2 ). 
This proves the claim, since by the arguments above, it shows 
that there exists a circuit of size at most (in fact, exactly) ( 2 ) 
if and only if G has a fc-clique, i.e., for the given construction 
a solution to the spark problem yields a solution to the clique 
problem as well. 

Suppose that A has a circuit C with c := \C\ < ( 2 ). Let 
d = (2) — c > 0. Since C is a circuit, Ac has c — 1 nonzero 
rows (similar to the arguments above). Because the ( 2 ) — k — 1 
non-vertex rows are among these, and by Lemma 2, Ac has 
(c— 1)— (( 2 ) — k— l) = k — d nonzero vertex rows. Denote the 
set of such rows by R, and let r be the number of edges in the 
subgraph of G induced by the vertices in R. Since \R\ = k — d 
vertices can induce at most ( 2 ) edges, r < ( 2 ). But 
surely all the edges in C are among those induced by R, 
so that r > c = ( 2 ) — d. Putting these inequalities together 
yields (*) - d < r < ( k ~ d ). However, since we assumed 
k > 4, it holds that ( 2 ) — d > ( 2 )> yielding a contradiction. 
Consequently, every circuit C of A must satisfy |C| > ( 2 ). ■ 

The smallest size (cardinality) of a circuit can be expressed 

as 

spark(A) := min{ ||a;||o | Ax = 0, x ^ 0}. (4) 

Clearly, there exists a circuit of size at most k if and only if 
the spark is at most k. Hence, Theorem 1 immediately yields 
the following. 

Corollary 1: Computing spark(^l) is strongly NP-hard. 

Remark 1: The idea of reducing from the clique problem to 
prove Theorem 1 is due to Larry Stockmeyer and appears in 
Theorem 3.3.6 of [32] (see also [31], [36]). However, [32] uses 
generic matrices that, in fact, represent transversal matroids (of 
bipartite graphs) and therefore have certain properties needed 
in the proof. The entries of these generic matrices are not 
specified, and to date there is no known deterministic way to 
do so such that the matrix represents a transversal matroid. We 
replaced the corresponding machinery by our explicit matrix 
construction and the arguments using Lemmas 1 and 2 to 
become independent of transversal matroid representations and 
work directly on vector matroids. For clarity, we split the 
result into Theorems 1 and Corollary 1. Note that the proof 
of Theorem 1 also shows NP-completeness of the problem to 
decide whether A has a circuit C with |C| = k. 

Remark 2: The results above are related to, but different 
from, the following. 

1) Theorem 1 in [30] shows that for an m x n matrix A, it 
is NP-hard to decide whether A has anmxra submatrix 
with zero determinant. This implies NP-hardness of 
computing the spark for the special case of the spark 
being equal to the row number of A. This restriction can 
in principle be removed, by appending all-zero rows to 
the matrix from the proof in [30], but one would then 
no longer be in the interesting case where the matrix has 
full (row) rank. 

Note that the reduction in [30] is from the Subset Sum 
Problem, so the derived NP-hardness holds in the weak 
sense. Our reduction from fc-Clique yields NP-hardness 
in the strong sense. Moreover, our proof admits spark 



values other than the row number. However, the row and 
column numbers in the reduction depend on the instance. 

2) In contrast to the results above, for graphic matroids, the 
girth can be computed in polynomial time [41], [32]. 

3) The paper [42] proves NP-hardness of computing the 
girth of the binary matroid, i.e., a vector matroid over 
F2. This, however, does not imply NP-hardness over the 
field of rational or real numbers, and the proof cannot 
be extended accordingly. Similarly, in [43] it was shown 
that, over F 2 , it is NP-complete to decide whether there 
exists a vector with k nonzero entries in the nullspace 
of a matrix. However, while this proof straightforwardly 
extends to the rational case, the result does not imply 
hardness of computing the spark either: While having 
spark = k! < k implies that there also is a circuit of 
size k, it does not guarantee that there also is a vector 
with k nonzeros in the nullspace. Furthermore, in [43] 
there is no lower bound on the spark as in Theorem 1 . 

We also have the following result, which shows another 
relation between minimum cardinality circuits and the task of 
finding sparsest solutions to underdetermined linear systems. 

Corollary 2: Given a matrix B, a specific column b of B, 
and a positive integer k, the problem of deciding whether there 
exists a circuit of B of size k which contains b is NP-complete 
in the strong sense. Consequently, it is strongly NP-hard to 
determine the minimum cardinality of circuits that contain a 
specific column b of B. 

Proof: Denote by A the matrix B without the column b. 
Then it is easy to see that B has a circuit of size k that contains 
b if and only if Ax = b has a solution with k — 1 nonzero 
entries. Deciding the latter is well-known to be NP-complete 
in the strong sense, see [MP5] in [4]. Moreover, computing 
the smallest possible size of those circuits that include b is 
equivalent to solving min{ ||x||o | Ax — b}. But this is 
precisely (Po), which is well-known to be strongly NP-hard. 

■ 

Remark 3: As mentioned in [29], one can reduce spark 
computations to (Po) as follows: For each column of A e 
Qmxn m tum ^ a( j ( j a new row w jjjj a 2 in this column and 

elsewhere. The right hand side b is the m-th unit vector. 
Now solve each such (P ) problem, and take the solution 
with smallest support. Interestingly, we do not know an easy 
reduction of the reverse direction (other than the very general 
reduction, because both associated decision problems are NP- 
complete). 

To conclude this section, we briefly consider so-called full 
spark frames. Anmxti matrix A with full rank m (m < n) is 
said to be full spark if spark(A) = m+1, i.e., every submatrix 
consisting of at most m columns of A has full rank. In [36], 
it was shown that testing a matrix for this property is hard 
for NP under randomized reductions. In fact, however, the 
following stronger result holds: 

Corollary 3: Given a rational matrix A, it is CONP- 
complete to decide whether A is a full spark frame. 

Proof: We can assume w.l.o.g. that A e Q mx ™ with rank 
m < n. Thus, A is full spark if spark(A) = m+1. Clearly, 
spark(A) = m+1 holds if and only if the question whether 
A has a singular m x m submatrix has a negative answer. 
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Since the latter decision problem is (weakly) NP-complete by 
[30, Theorem 1], deciding whether A is a full spark frame is 
CoNP-hard. 

Moreover, this problem is contained in CONP, since the "no" 
answer can be certified in polynomial time by specifying a 
singular (square) submatrix and because checking singularity 
can be done in polynomial time. ■ 
Note that for the above proof, we cannot employ Theorem 1 
(to obtain hardness in the strong sense), because this would 
require considering the matrix construction used in the reduc- 
tion from the clique problem (see proof of the theorem) with 
k = n—l, since in that case the matrix has ( 2 ) = n+( 2 ) — k— 1 
rows and we show hardness for asking whether the spark is 
at most ( 2 ). However, the clique problem can be solved in 
polynomial time for any constant k, and in the above-described 
scenario, k would indeed no longer be part of the problem 
input. 

Remark 4: Above, and in the related complexity results to 
follow, we show that the decision problem under consider- 
ation has a negative answer if and only if a known NP- 
complete problem has a positive answer. Since by definition 
the complementary problem of an NP-complete problem is 
CO NP-complete, the respective hardness results follow — see 
also [4]. Both NP- and CoNP-completeness imply that no 
polynomial time algorithm exists unless P=NP (or equiva- 
lently P=C0NP). Since a problem is NP-hard if and only if it 
is CoNP-hard (every problem in CONP can be Turing-reduced 
to it; cf [34, Ch. 15.7]), we use the term NP-hard throughout 
the article. 

In the next sections we shall see how we can deduce 
NP-hardness of computing restricted isometry or nullspace 
constants from the above results. 

III. NP-HARDNESS OF COMPUTING THE RESTRICTED 
ISOMETRY CONSTANT 

Many recoverability results are based on the restricted 
isometry property (RIP) of a given matrix A. Recall that 
for a positive integer k, A satisfies the RIP of order k with 
constant Sk if (1) holds. Given A and k, the smallest such 
constant is the RIC 5 k . Note that (1) holds for Sk = if and 
only if A is orthogonal and that Sk < is impossible. 

A popular result states that if the RIP of order 2k is satisfied 
with some constant 5 2 k < V%— 1 then ^?o l -equivalence holds 
(i.e., the solutions of (Pi) and (P ) coincide) if the optimal 
solution to (Po) has at most k nonzeros, see [10]. A series 
of papers has been devoted to developing conditions of this 
flavor, often by probabilistic analyses. One of the most recent 
results, see [44], shows that £q-£\ -equivalence for fc-sparse 
solutions already holds if Sk < 0.307 is feasible (note that the 
RIP order is k, not 2k as above). Similar conditions exist for 
the denoising problem and greedy algorithms, see, e.g., [10], 
[15]. 

Ever since the RIP has been introduced, it has been re- 
marked that its evaluation is suspected to be computationally 
intractable, based on the observation that a brute-force method 
would have to inspect all submatrices induced by column 
subsets of sizes up to k. Of course, by itself, this does 



not generally rule out the possible existence of an efficient 
algorithm. However, we show below that (given A and k) 
deciding whether there exists a constant 5 k < 1 such that (1) 
holds is CoNP-complete in the strong sense. Thus, unless 
P=C0NP, which is equivalent to P=NP, there exists no 
polynomial time algorithm to answer this question. 

We need the following technical result in the following. 

Lemma 3: Let A = (ay) G Q mx " be a matrix. Define 

Anax := max{|a i:) | | i G {1, ...,m}, j G {1, ...,n}} 

and C := S^t^-v^)!. Then A := ^A satisfies 

\\Ax\\% < (1 + S)\\x\\l for all x G R™ and S > 0. 

Moreover, the encoding length of A is polynomial in that 
of A. 

Proof: First, observe that the largest singular value of A, 
Cmax(A), satisfies 

O-max(A) = ||A|| 2 < A max y/mn 

< 2 r io S2 (-Amax v^m)l _ (J (5) 

It follows that 

HAxlH^AIIlHl! 

<^^\\A\\l\\x\\l = \\x\\l<{l + 5)\\x\\l 

for any 8 > 0. Moreover, the encoding length of C and of A 
is clearly polynomially bounded by m, n, and the encoding 
length of A. ■ 
By the singular value interlacing theorem (see, e.g., [45]), 

^"max 

(A) is an upper bound for the largest singular value 
of every submatrix of A. Thus, the above lemma essentially 
shows that by scaling the matrix A, one can focus on the 
left part of the RIP (1) (a similar argument has been derived 
independently in [20]). This leads to the following complexity 
result. 

Theorem 2: Given a matrix A G Q mx ™ and a positive 
integer k, the problem to decide whether there exists some 
constant Sk < 1 such that A satisfies the RIP of order k with 
constant Sk is strongly CoNP-complete. 

Proof: We first show that the problem is in CONP. To 
certify the "no" answer, it suffices to consider a vector x with 
1 < || a; || o < k that tightly satisfies (1) for S k = 1. This 
implies Ax = 0. Clearly, since x is contained in the nullspace 
of A, we can assume that x is rational with encoding length 
polynomially bounded by the size of A. Then, we can verify 
\\ x \\o < k and Ax = in polynomial time, which shows that 
the "no" answer can be certified in polynomial time. 

To show hardness, we reduce the problem to decide whether 
there exists a circuit of size at most k, which is strongly NP- 
complete by Theorem 1 . Consider the matrix A as defined in 
Lemma 3; note that the circuits of A and A coincide, since 
nonzero scaling does not affect linear dependencies among 
columns. We claim that there exists a circuit x with \\x\\o < k 
if and only if A violates (1) for all Sk < 1. Since deciding 
the former question is NP-complete, this completes the proof. 

Clearly, if such an x exists, then 

(1-4) < \\Ax\\ 2 2 = 



6 
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implies that we must have 5k > 1. 

For the converse, assume that there does not exist 5k < 1 
for which (1) holds. By Lemma 3, the right part of (1) is 
always satisfied. Consequently, there must exist a vector x 
with 1 < ||a:||o < k such that the left part is tight, i.e., 

o > (i - J fc ) = > o. 

This implies that A x = 0. Thus, there also exists a circuit x 
with ||x||o < k, which shows the claim. ■ 
Usually, one is interested in the smallest constant 5k for 
which (1) holds, i.e., the restricted isometry constant (RIC) 



5u := min 5 

8>0 



(6) 



s.t. (1 - 5)\\x\\l < \\Ax\\l < (l + 5)\\x\\l 
for all x with ||x||o < k. 

Recall that 5 < is impossible, resulting in the condition 
5 > in (6). We immediately obtain the following complexity 
result. 

Corollary 4: For a given matrix A G <Q mx " and positive 
integer fc, it is strongly NP-hard to compute the restricted 
isometry constant 5 k . 

Proof: By definition, 5_ k < 5 k for any constant 5k such 
that A satisfies the RIP of order k with constant 5k- But 
Theorem 2 shows that deciding whether there exists a constant 
5k < 1 is CoNP-complete in the strong sense. Computing 5 k 
answers that decision problem and hence must be strongly 
NP-hard. ■ 

A. Asymmetric restricted isometry constants 

It has been remarked in [9] that the symmetric nature of 
the RIP can be overly restrictive. In particular, the influence 
of the right inequality in (6) is often stronger, although the left 
inequality is more important in the context of sparse recovery. 
For instance, the often stated condition 5 2k < 1 for uniqueness 
of fc-sparse solutions (see, e.g., [10]) should in fact read 5% k < 
1, where 



5 k := min <5 s.t. (1 

<5>0 



\i < \\Ax ll2 



- Vx : \\x\\ < k 
(7) 

is the lower asymmetric restricted isometry constant [9] (see 
also [46]). Correspondingly, the upper asymmetric RIC is 
given by 

6%:= min 5 s.t. (1 + 5)\\x\\ 2 2 > \\Ax\\ 2 2 Vx : \\x\\ < k. 

5>0 

(8) 

The central argument in the proof of Theorem 2 in fact 
shows the following: 

Corollary 5: Given a matrix A e Q mx ™ and a positive 
integer k, it is strongly NP-hard to compute 5 k . 

In contrast, the computational complexity of computing the 
upper asymmetric RIC 5% seems to be unknown. Note that 
1 + 5% is always bounded above by the largest singular value 
c m ax(^4) of A. However, rewriting (8) as 

6£ = max{ A max (Aj A s ) | S C {1, . . . , n}, \S\ < k } - 1, 

it is clear that this bound generally needs not be tight. 



B. RIP certification 

In this section, we show that the RIP certification problem, 
i.e., deciding whether a matrix A satisfies the RIP with given 
order k and given constant 5 k e (0, 1), is CoNP-complete. The 
main arguments used in the proofs of the following Lemma 
and Corollary have been independently derived by us and the 
authors of [20]. 

The following observation is essential. 

Lemma 4: Given a matrix A e <Q mx ™ and a positive 
integer k, if spark(A) > k, there exists a constant e > 
with encoding length polynomially bounded by k and that of 
A such that \\Ax\\% > e \\x\\% for all x with ||x|| < k. 

Proof: Assume without loss of generality that A has only 
integer entries (this can always be achieved by scaling with 
the least common denominator of the matrix entries, which 
influences e by a polynomial factor only). 

Define A max as in Lemma 3. Note that spark(A) > k 
implies that every submatrix As with S C {1, . . . , n}, \S\ < 
k, has linearly independent columns. Consider an arbitrary 
such S. Then, Aj,As is positive definite, so its smallest eigen- 
value fulfills X m i n (AgA s ) > and also det(AgAs) > 0. 
Moreover, since the absolute values of entries of A are integers 
in {0, 1, ... , Amax}, the entries of Ag A s are also integral and 
lie in {0, 1, . . . , mA 2 na:x }. Therefore, it must in fact hold that 
det(AjA s ) > 1 and A max (AjA s ) > 1. It follows that 

|S| 

l<dct(A T s A s )=l[K(AjA s ) 



111 A m i n 



<X min (A^A s ) (\S\- max |( 

< A min (AjA s ) (kmA^f^ 1 
Consequently, we have that 

1 



AlA 



S A s)io\ 



fe-1 



[km A 



2 

max 



ifc-1 



=: e > 0. 



(9) 



Since S was arbitrary, ||Aa;||2 > Xmin(Aj A s )\\x\\l > 
s\\x\\ 2 for all x with support S C {1, ...,n}, |5| < k. 
Moreover, the encoding length of ^l max , and therefore that 
of e, is clearly polynomially bounded by k and the encoding 
length of A, which completes the proof. ■ 

Corollary 6: Given a matrix A e Q mx ™, a positive integer 
k, and some constant 5k € (0, 1), it is CoNP-hard in the strong 
sense to decide whether A satisfies the RIP of order k with 
constant 5k- 

Proof: Consider a matrix A as in Lemma 3, so we can 
again focus on the left inequality of the RIP. Clearly, if A has 
a circuit of size at most k, A cannot satisfy the RIP of order k 
with any given 5 k E (0, 1), since in this case, H-Ax^ = < 
(1 — 5fe)||x||2 for some x with ||x||o < k. Moreover, Lemma 4 
shows that if A has no circuit of size at most k, A satisfies 
the RIP or order k with constant 1 — e € (0, 1), where e has 
size polynomially bounded by k and that of A, cf. (9). By 
Theorem 2, it is strongly CoNP-complete to decide whether 
there exists a constant 5 k < 1 such that A satisfies the RIP of 
a given order k with this constant. But as seen above, such a 
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constant exists if and only if A satisfies the RIP of fc with the 
constant 1 — e, too. Thus, deciding whether the RIP holds for 
a given matrix, order, and constant, is strongly NP-hard. ■ 
Remark 5: It is unclear whether the problem in Corollary 6 
is in CONP: It does not seem easy to show that there exists a 
certificate x for the "no" answer with 1 < ||x||o < k that 
violates the RIP with constant 6k and has a polynomially 
bounded encoding length. 

IV. NP-HARDNESS OF COMPUTING THE NULLSPACE 
CONSTANT 

Similar to the RIP, the nullspace property (NSP) allows to 
guarantee £ -^i-equivalence. Recall that A £ Q mx ™ satisfies 
the NSP of order k with constant a k if (2) holds for all 
x £ R™ with Ax = 0. The nullspace property guarantees 
exact recovery of fc-sparse solutions to (P ) by solving (Pi) 
whenever (2) holds with some constant a k < 1/2. More 
precisely, a k < 1/2 is feasible for (2) if and only if for all x 
with ||£||o < k, x is the unique solution to (Pi) with A and 
b := Ax, see [22]. (In fact, this implies solution uniqueness 
for min{||a?||^ | Ax = b} for any < p < 1, see [24], 
[25].) In particular, x must then also be the unique solution 
to the corresponding (P )-instance, i.e., €o i -equivalence for 
fc-sparse solutions holds. 

As for the RIP, one is typically interested in the smallest 
constant a k , the nullspace constant (NSC), such that the NSP 
of order k holds with this constant. Thus, the NSC is given by 

a fc := min a s.t. ||a:||fc,i < a||x||i Vx:Ax — 0, (10) 
or equivalently, 

a k := max ||x s ||i (11) 
s.t. Ax = 0, ||x||i = 1, SC{l,...,n}, \S\ < k. 

However, the following results show that computing a k is a 
challenging problem. 

Theorem 3: Given a matrix A £ Q mx ™ and a positive 
integer k, the problem to decide whether A satisfies the NSP 
of order k with some constant a k < 1 is CoNP-complete in 
the strong sense. 

Proof: First of all, note that (2) is equivalent to the 
condition that 

Mi < Nil d2) 

holds for all S C {l,...,n}, \S\ < k, and x £ R n with 
Ax = 0. Clearly, (2) and (12) always hold for some a k £ 

[0,1]. 

We first show that the problem is in CONP. To certify the 
"no" answer, it suffices to consider a vector x with Ax = 
and a set ^ S C {l,...,n} with \S\ < k that tightly 
satisfy (12) for ak = 1. This implies that S equals the support 
of x. Thus, 1 < ||x||o < k. Clearly, since x is contained in the 
nullspace of A, it can be assumed to be rational with encoding 
length polynomially bounded by the size of A. This shows that 
the "no" answer can be certified in polynomial time. 

To show hardness, we claim that the matrix A has a circuit 
of size at most k if and only if there does not exist any a k < 1 
such that (2) holds. Since the former problem is strongly NP- 
complete by Theorem 1, this completes the proof. 



Assume A has a circuit of size at most k. Then there exists 
a vector x in the nullspace of A with 1 < ||a;||o < k. It 
follows that 1 1 as 1 1 fc, i = || as || i - Thus, (2) implies a k > 1. Since, 
trivially, a k < 1, we conclude that a k = 1. 

Conversely, assume that there exists no a k < 1 such that (2) 
holds for A and k. This implies that there is a vector x with 
Ax = and 1 < ||x||o < k such that ||a:||fc,i = ||a;||i, because 
otherwise a k < 1 would be possible. But this means that the 
support of x contains a circuit of A of size at most k, which 
shows the claim. ■ 

We immediately obtain the following. 

Corollary 7: Given a matrix A £ Q" IX ™ and a positive 
integer fc, it is strongly NP-hard to compute the nullspace 
constant a k . 

V. Concluding remarks 

The results of this paper show that it is CoNP-complete (in 
the strong sense) to answer the following questions in the case 
7=1: "Given a matrix A and a positive integer fc, does the 
RIP or NSP hold with some constant < 7?" It is important to 
note that our results do not imply the hardness for every fixed 
constant 7 < 1. (Note that such questions become solvable 
in time 0(n poly ^) if fc is fixed.) For instance, Corollary 6 
asserts that it is strongly CoNP-complete to certify the RIP 
for given A, k and S k £ (0,1) in general. The actual 6k 
appearing in the proof, however, is very close to 1 and thus 
far from values of 7 that could yield recovery guarantees; cf. 
the remarks following [20, Theorem 4]. Similarly, the NSP 
guarantees l§-l\-equivalence for a k < 1/2, while we proved 
hardness only for deciding whether a k < 1 (which implies 
that computing a k is hard). The complexity of these related 
questions remain open. Nevertheless, our results provide a 
justification to investigate general approximation algorithms 
(which compute bounds on 6 k or a k ), as done in [28], [17], 
[26], [18], instead of searching for exact polynomial time 
algorithms. 

One could also investigate whether it is hard to approximate 
the constants associated with the RIP or NSP in polynomial 
time to within some factor; strong inapproximability results 
for (P ) appear in [47]. 

Moreover, (co)NP-hardness does not necessarily exclude 
the possibility of practically efficient algorithms. So far, to 
the best of our knowledge, the focus has been laid largely on 
relaxations or heuristics. In [19], it was shown that one can 
do better than exhaustive search to certify the RIP, making 
use of the nondecreasing mono tonicity of 6 k with growing fc. 
More work on exact algorithms could shed more light on the 
behavior of the RIP and NSP. 
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